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Claims 

[cl ] sq \ 1 . A method for automatically generating a query, comprising 

^^\^ef\n\ng an organized classification of document content With each class in the organized 
classification of document content having associated the/ewith a classification label; each 
classification label corresponding to a category of information in an information retrieval 
system; 

identifying a set of entities in selected document content for searching additional 
information related thereto using the information retrieval system; 
assigning the selected document content a classification label from the organized 
classification of content; 

automatically formulating a query that restrict* a search at the information retrieval 
system for information concerning the set oyentities to the category of information in the 
information retrieval system identified by tbfe assigned classification label. 

2. The method according to claim 1 , further comprising limiting the query by adding 
terms relating to context information surrounding the set of entities in the selected 
document content. 

3. The method according to claim ^ wherein the number of terms added is limited to a 
predefined number. 

4. The method according to cla/m 2, further comprising limiting the query by adding 
terms defining the assigned classification label. 

5. The method according to/claim 1 , wherein the organized classification of document 
content is defined using aynierarchical organization. 

6. The method according to claim 1 , further comprising using a text categorizer to assign 
the classification labelAssigned from the organized classification of content. 

7. The method acco/ding to claim 6, further comprising: 

extracting with thtftext categorizer a set of terms relating to the document content; and 
appending to thyquery ones of the set of terms extracted by the text categorizer to 
contextualize tbfe query. 

[c8] 8. The method according to claim 7, further comprising abbreviating the set of terms 

extracted b/ the text categorizer to a predefined limit of terms. 
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9. The method according to claim 8, wherein said abbreviating comprises: 
extracting noun phrases from the selected document content; 

ranking the noun phrases by those that occur most frequently in tl?fe document content; 
defining a subset of noun phrases by identifying those ranked nj6un phrases that occur 
more frequently than a first predefined frequency; 

ranking those words in the subset of noun phrases by theiryfrequency of occurrence to 
define an ordered list of words; 

defining a subset of the ordered list of words by identiwing those ranked words that 
occur more frequently than a second predefined freqyency; 

re-ranking the subset of words in inverse frequency to their use in the category of 
information in the information retrieval system id/ntified by the assigned classification 
label; 

using only those highest ranked words in the r^-ranked subset of words to define the set 
of terms appended to the query. 

1 0. The method according to claim 1 , wh^/ein each class in the organized classification 
of document content has associated therewith a characteristic vocabulary. 

1 1 . The method according to claim 1 further comprising ranking results from the query 
performed at the information retrieval system in accordance with one of the assigned 
classification label and the characteristic vocabulary. 

12. The method according to claim 1 1 , using the method in a system for enriching 
selected content of a document with personalities that identify enrichment themes. 

1 3. The method according/to claim 1 , further comprising automatically identifying the set 
of entities using a servic/ that recognizes entities of a predefined type. 



[cH] 

14. A system for automatically generating a query, comprising: 
an entity extractor/or identifying a set of entities in selected document content for 
searching information related thereto using an information retrieval system; 
a categorizer far defining an organized classification of document content with each class 
in the organisation of content having associated therewith a classification label; each 
classification label corresponding to a category of information in the information retrieval 
system; tlie categorizer assigning the selected document content a classification label 
from t^e organized classification of content; 
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a query generator for automatically formulating a query that restricts a search at/he 
information retrieval system for information concerning the set of entities to the category 
of information in the information retrieval system identified by the assigned/Classification 
label. 

1 5. The system according to claim 14, further comprising a short lenorfi aspect vector 
generator for generating terms relating to context information surrounding the set of 
entities in the selected document content; wherein the query generator adds the terms 
relating to the context information to limit the query. 

16. The system according to claim 15, wherein the query generator further limits the 
query by adding terms defining the selected classification label provided by the 
categorizer. 

1 7. The system according to claim 1 6, further comprising a content manager for 
enriching the selected document content with results provided from the information 
retrieval system using the query. 

1 8. An article of manufacture for use in a computer system, comprising: 
a memory; 

instructions stored in the memory for $/perating a method for automatically generating a 
query, comprising: 

defining an organized classif icatior/of document content with each class in the organized 
classification of document contertt having associated therewith a classification label; each 
classification label corresponding to a category of information in an information retrieval 
system; 

identifying a set of entities/in selected document content for searching information 
related thereto using the /information retrieval system; 

assigning the selected document content a classification label from the organized 
classification of content; 

automatically formulating a query that restricts a search at the information retrieval 
system for information concerning the set of entities to the category of information in the 
information retneval system identified by the assigned classification label. 



1 9. The article of manufacture according to claim 1 8, wherein the instructions stored in 
the memory further comprise limiting the query by adding terms relating to context 
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information surrounding the set of entities in the selected docum 




20. The article of manufacture according to clahffl9, wherein the instructions stored in 
the memory further comprisejurther limiting the query by adding terms defining the 
assigned classificatiojvKDel. 
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